Assessing rule interestingness with a probabilistic measure of deviation from equilibrium

نویسندگان

  • Julien Blanchard
  • Fabrice Guillet
  • Henri Briand
  • Régis Gras
چکیده

Assessing rule interestingness is the cornerstone of successful applications of association rule discovery. In this article, we present a new measure of interestingness named IPEE. It has the unique feature of combining the two following characteristics: first, it is based on a probabilistic model, and secondly, it measures the deviation from what we call equilibrium (maximum uncertainty of the consequent given that the antecedent is true). We study the properties of this new index and show in which cases it is more useful than a measure of deviation from independence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the interestingness of temporal rules with Sequential Implication Intensity

In this article, we study the assessment of the interestingness of sequential rules (generally temporal rules). This is a crucial problem in sequence analysis since the frequent pattern mining algorithms are unsupervised and can produce huge amounts of rules. While association rule interestingness has been widely studied in the literature, there are few measures dedicated to sequential rules. C...

متن کامل

ION: a pertinent new measure for mining information from many types of data

Since last decade, many methods with appropriate measures are proposed in knowledge discovery in databases. These measures aim at both improving the quality of mined association rules and reducing the problem of many nested rules. This paper presents a new statistical Implication Oriented Normalized measure, denoted ION. ION turns to be a unifying framework for several probabilistic measures of...

متن کامل

Boolean Analyzer - An Algorithm That Uses A Probabilistic Interestingness Measure to find Dependency/Association Rules In A He

A new, binary-based technique is presented for finding dependency/association rules called the Boolean Analyzer (BA). With initial guidance from a domain user or domain expert, BA is given one or more metrics to partition the entire data set. This leads to analyzing the implicit domain knowledge and creating weighted rules in the form of boolean expressions. To augment the analysis of the rules...

متن کامل

A New Probabilistic Measure of Interestingness for Association Rules, Based on the Likelihood of the Link

The interestingness measures for pattern associations proposed in the data mining literature depend only on the observation of relative frequencies obtained from 2×2 contingency tables. They can be called “absolute measures”. The underlying scale of such a measure makes statistical decisions difficult. In this paper we present the foundations and the construction of a probabilistic interestingn...

متن کامل

Relative Measure for Mining Interesting Rules

This paper presents a measure which estimates interestingness of a rule relative to its corresponding common sense rules. Mining interesting rules is one of the important data mining tasks. Interesting rules bring novel knowledge that helps decision makers for advantageous actions. Interestingness is a relative issue. It is relative with what is known about the domain. A measure which can estim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005